Reverberant speech recognition exploiting clarity index estimation

نویسندگان

  • Pablo Peso Parada
  • Dushyant Sharma
  • Patrick A. Naylor
  • Toon van Waterschoot
چکیده

We present single-channel approaches to robust automatic speech recognition (ASR) in reverberant environments based on non-intrusive estimation of the clarity index (C50). Our best performing method includes the estimated value of C50 in the ASR feature vector and also uses C50 to select the most suitable ASR acoustic model according to the reverberation level. We evaluate our method on the REVERB Challenge database employing two different C50 estimators and show that our method outperforms the best baseline of the challenge achieved without unsupervised acoustic model adaptation, i.e. using multi-condition hidden Markov models (HMMs). Our approach achieves a 22.4% relative word error rate reduction in comparison to the best baseline of the challenge.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining articulatory and acoustic information for speech recognition in noisy and reverberant environments

Robust speech recognition under varying acoustic conditions may be achieved by exploiting multiple sources of information in the speech signal. In addition to an acoustic signal representation, we use an articulatory representation consisting of pseudoarticulatory features as an additional information source. Hybrid ANN/HMM recognizers using either of these representations are evaluated on a co...

متن کامل

Blind estimation of room acoustic parameters using kernel regression

Room acoustic parameters are key information for dereverberation or speech recognition. Usually, when one needs to assess the level of reverberation, only the reverberation time RT60 or a direct to reverberant sounds index Dτ is estimated. Yet, methods which blindly estimate the reverberation time from reverberant recorded speech do not always differentiate the RT60 from the Dτ to evaluate the ...

متن کامل

Model-based blind estimation of reverberation time: application to robust ASR in reverberant environments

This paper presents a method for blind estimation of reverberation times in reverberant enclosures. The proposed algorithm is based on a statistical model of short-term log-energy sequences for echo-free speech. Given a speech utterance recorded in a reverberant room, it computes a Maximum Likelihood estimate of the room full-band reverberation time. The estimation method is shown to require li...

متن کامل

Improve Speech Recognition Performance in Reverberant Environment Based on Estimation of Energy Feature

The purpose of this paper is to improve speech recognition performance in reverberant environment with distant talking, which based on directly speech processing or selected Log-energy feature. Speech recognition performance can be improved by changing the value of Log-energy feature or directly speech energy. Experiments used CENSREC-4 corpus to evaluate distant-talking speech under various re...

متن کامل

Performance estimation of reverberant speech recognition based on reverberant criteria RSR-dn with acoustic parameters

Reverberation-robust speech recognition has become very important in the field of distant-talking speech recognition. However, as no common reverberation criteria for the recognition of reverberant speech have yet been proposed, it has been difficult to estimate its effectiveness. To address this problem in 2007, we investigated early and late reflections on distanttalking speech recognition to...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • EURASIP J. Adv. Sig. Proc.

دوره 2015  شماره 

صفحات  -

تاریخ انتشار 2015